Maximum Likelihood Estimation for Lossy Data Compression∗

نویسندگان

  • Matthew Harrison
  • Ioannis Kontoyiannis
چکیده

In lossless data compression, given a sequence of observations (Xn)n≥1 and a family of probability distributions {Qθ}θ∈Θ, the estimators (θ̃n)n≥1 obtained by minimizing the ideal Shannon code-lengths over the family {Qθ}θ∈Θ, θ̃n := arg min θ∈Θ [ − logQθ(X 1 ) ] , whereXn 1 := (X1, X2, . . . , Xn), coincide with the classical maximum-likelihood estimators (MLEs). In the corresponding lossy compression setting, the ideal Shannon code-lengths are approximately − logQθ(B(X 1 , D)) bits, where B(Xn 1 , D) is the distortion-ball of radius D around the source sequence Xn 1 . In this work we consider the analogous estimators obtained by minimizing these lossy code-lengths, θ̂n := arg min θ∈Θ [ − logQθ(B(X 1 , D)) ] . The θ̂n are a lossy version of the MLEs, which we call “lossy MLEs”. We investigate the strong consistency of lossy MLEs when the Qθ are i.i.d. and the sequence (Xn)n≥1 is stationary and ergodic.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Analysis of Color Image Transmission over Rayleigh Fading Channel in an Outdoor Environment

In this paper we investigate the impact of different equalization techniques for digital communication systems that employ MPSK and MQAM modulation technique to transmit information such as color image over Rayleigh fading AWGN channel. Performance is analyzed in terms of bit error rate by MATLAB simulation. For the simulation of Rayleigh fading channel the different property values such as pat...

متن کامل

Estimation of the Rate-Distortion Function

Motivated by questions in lossy data compression and by theoretical considerations, this paper examines the problem of estimating the rate-distortion function of an unknown (not necessarily discretevalued) source from empirical data. The main focus is the behavior of the so-called “plug-in” estimator, which is simply the rate-distortion function of the empirical distribution of the observed dat...

متن کامل

A Minimum Description Length Proposal for Lossy Data Compression

We give a development of the theory of lossy data compression from the point of view of statistics. This is partly motivated by the enormous success of the statistical approach in lossless compression, in particular Rissanen’s celebrated Minimum Description Length (MDL) principle. A precise characterization of the fundamental limits of compression performance is given, for arbitrary data source...

متن کامل

The Convergence of Lossy Maximum Likelihood Estimators

Given a sequence of observations (Xn)n≥1 and a family of probability distributions {Qθ}θ∈Θ, the lossy likelihood of a particular distribution Qθ given the data Xn 1 := (X1,X2, . . . ,Xn) is defined as Qθ(B(X 1 ,D)), where B(Xn 1 ,D) is the distortion-ball of radius D around the source sequence X n 1 . Here we investigate the convergence of maximizers of the lossy likelihood.

متن کامل

Second-order properties of lossy likelihoods and the MLE/MDL dichotomy in lossy compression

This paper develops a theoretical framework for lossy source coding that treats it as a statistical problem, in analogy to the approach to universal lossless coding suggested by Rissanen’s Minimum Description Length (MDL) principle. Two methods for selecting efficient compression algorithms are proposed, based on lossy variants of the Maximum Likelihood and MDL principles. Their theoretical per...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002